A New Label Maximization Based Incremental Neural Clustering Approach: Application to Text Clustering
Identifieur interne : 002786 ( Main/Exploration ); précédent : 002785; suivant : 002787A New Label Maximization Based Incremental Neural Clustering Approach: Application to Text Clustering
Auteurs : Jean-Charles Lamirel [France] ; Raghvendra Mall [Inde] ; Shadi Al Shehabi ; Ghada SafiSource :
- Lecture Notes in Computer Science [ 0302-9743 ]
Abstract
Abstract: Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual dataset. This is especially true for the recent adaptive versions of these algorithms, like the incremental growing neural gas algorithm (IGNG) and the label maximization based incremental growing neural gas algorithm (IGNG-F). In this paper we highlight that there is a drastic decrease of performance of these algorithms, as well as the one of more classical algorithms, when a heterogeneous textual dataset is considered as an input. Specific quality measures and cluster labeling techniques that are independent of the clustering method are used for the precise performance evaluation. We provide variations to incremental growing neural gas algorithm exploiting in an incremental way knowledge from clusters about their current labeling along with cluster distance measure data. This solution leads to significant gain in performance for all types of datasets, especially for the clustering of complex heterogeneous textual data.
Url:
DOI: 10.1007/978-3-642-21566-7_26
Affiliations:
- France, Inde
- Grand Est, Lorraine (région)
- Nancy, Vandœuvre-lès-Nancy
- Centre national de la recherche scientifique, Laboratoire lorrain de recherche en informatique et ses applications, Synalp (Loria), Université de Lorraine
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001799
- to stream Istex, to step Curation: 001780
- to stream Istex, to step Checkpoint: 000686
- to stream Main, to step Merge: 002828
- to stream Main, to step Curation: 002786
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A New Label Maximization Based Incremental Neural Clustering Approach: Application to Text Clustering</title>
<author><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation><country>France</country>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author><name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</author>
<author><name sortKey="Al Shehabi, Shadi" sort="Al Shehabi, Shadi" uniqKey="Al Shehabi S" first="Shadi" last="Al Shehabi">Shadi Al Shehabi</name>
</author>
<author><name sortKey="Safi, Ghada" sort="Safi, Ghada" uniqKey="Safi G" first="Ghada" last="Safi">Ghada Safi</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:66EFA7567D07A66E08F6541D90A5AA61F40F9C50</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-21566-7_26</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HCB-MQLX3TR0-8/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001799</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">001799</idno>
<idno type="wicri:Area/Istex/Curation">001780</idno>
<idno type="wicri:Area/Istex/Checkpoint">000686</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000686</idno>
<idno type="wicri:doubleKey">0302-9743:2011:Lamirel J:a:new:label</idno>
<idno type="wicri:Area/Main/Merge">002828</idno>
<idno type="wicri:Area/Main/Curation">002786</idno>
<idno type="wicri:Area/Main/Exploration">002786</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A New Label Maximization Based Incremental Neural Clustering Approach: Application to Text Clustering</title>
<author><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, Vandoeuvre-lés-Nancy</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
<placeName><settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="team" n="7">Synalp (Loria)</orgName>
<orgName type="lab">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="EPST">Centre national de la recherche scientifique</orgName>
</affiliation>
</author>
<author><name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
<affiliation wicri:level="1"><country xml:lang="fr">Inde</country>
<wicri:regionArea>Center of Data Engineering, IIIT Hyderabad, NBH-61, Hyderabad, Andhra Pradesh</wicri:regionArea>
<wicri:noRegion>Andhra Pradesh</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Inde</country>
</affiliation>
</author>
<author><name sortKey="Al Shehabi, Shadi" sort="Al Shehabi, Shadi" uniqKey="Al Shehabi S" first="Shadi" last="Al Shehabi">Shadi Al Shehabi</name>
<affiliation></affiliation>
<affiliation><wicri:noCountry code="no comma">E-mail: shadialshehabi@gmail.com</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Safi, Ghada" sort="Safi, Ghada" uniqKey="Safi G" first="Ghada" last="Safi">Ghada Safi</name>
<affiliation></affiliation>
<affiliation><wicri:noCountry code="no comma">E-mail: ghada1@scs-net.org</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s" type="main" xml:lang="en">Lecture Notes in Computer Science</title>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual dataset. This is especially true for the recent adaptive versions of these algorithms, like the incremental growing neural gas algorithm (IGNG) and the label maximization based incremental growing neural gas algorithm (IGNG-F). In this paper we highlight that there is a drastic decrease of performance of these algorithms, as well as the one of more classical algorithms, when a heterogeneous textual dataset is considered as an input. Specific quality measures and cluster labeling techniques that are independent of the clustering method are used for the precise performance evaluation. We provide variations to incremental growing neural gas algorithm exploiting in an incremental way knowledge from clusters about their current labeling along with cluster distance measure data. This solution leads to significant gain in performance for all types of datasets, especially for the clustering of complex heterogeneous textual data.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
<li>Inde</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement><li>Nancy</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName><li>Centre national de la recherche scientifique</li>
<li>Laboratoire lorrain de recherche en informatique et ses applications</li>
<li>Synalp (Loria)</li>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree><noCountry><name sortKey="Al Shehabi, Shadi" sort="Al Shehabi, Shadi" uniqKey="Al Shehabi S" first="Shadi" last="Al Shehabi">Shadi Al Shehabi</name>
<name sortKey="Safi, Ghada" sort="Safi, Ghada" uniqKey="Safi G" first="Ghada" last="Safi">Ghada Safi</name>
</noCountry>
<country name="France"><region name="Grand Est"><name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</region>
<name sortKey="Lamirel, Jean Charles" sort="Lamirel, Jean Charles" uniqKey="Lamirel J" first="Jean-Charles" last="Lamirel">Jean-Charles Lamirel</name>
</country>
<country name="Inde"><noRegion><name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</noRegion>
<name sortKey="Mall, Raghvendra" sort="Mall, Raghvendra" uniqKey="Mall R" first="Raghvendra" last="Mall">Raghvendra Mall</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002786 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002786 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:66EFA7567D07A66E08F6541D90A5AA61F40F9C50 |texte= A New Label Maximization Based Incremental Neural Clustering Approach: Application to Text Clustering }}
This area was generated with Dilib version V0.6.33. |